?\ l (K £RF S>rs Corrected by th STIC SystewBranch <y A 

I I Changed a file from non-ASCII to ASCII Verified by: ( ffjg^^y- j_. jj 

I I Changed the margins in cases where the sequence text was "wrapped" down to the next line. 

I I Edited a format error in the Current Application Data section, specif icallyC MTrnr- j-^ 

— 1 ; L " i cnr |gyc£NT £fi isoqw 

I I Edited the Current Application Data section with the actual current number. The number inputted by the 

applicant was O the prior application data; or □ other . 

| | Added the mandatory heading and subheadings for "Current Application Data". 

| | Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integer. 

| | Changed the spelling of a mandatory field (the headings or subheadings), specifically: 

| | Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

| | Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NO's edited: 

| | Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

| [ Inserted colons after headings/subheadings. Headings edited included: 

[ [ Deleted extra, invalid, headings used by an applicant, specifically: 




non-ASCII "garbage" at the beginning/BodjofJtesrn secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invalid text, such as - . 

| [ Inserted mandatory headings, specifically: 



I | Corrected an obvious error in the response, specifically: 



I | Edited identifiers where upper case is used but lower case is required, or vice versa. 

| 1 Corrected an error in the Number of Sequences field, specifically: 

1 1 A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

1 | Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:" field accordingly (error 
due to a Patentln bug). Sequences corrected: . 

I I Other: 



Examin r: Th abov corrections must b communicated to th applicant in th first Offic 
Action. DO NOT send a copy of this form. 3/1/95 




RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/876,132 



TIME: 10:17:14 




INPUTSET: S28494.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



(ii) TITLE OF THE INVENTION: ENTEROBACTERIACEAE FERMENTATION 



(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Genencor International, Inc. 

(B) STREET: 925 Page Mill Road 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304-1013 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/876,132 

(B) FILING DATE: 2 3-JUN-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Glaister, Debra J. 

(B) REGISTRATION NUMBER: 3 3,888 

(C) REFERENCE/DOCKET NUMBER: GC372 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-846-7620 

(B) TELEFAX: 650-845-6504 



SEQUENCE LISTING 




General Information 



( i ) APPLICANT : Fowler , Timothy 
Stuart, Causey 



STAINS 



(2) INFORMATION FOR SEQ ID NO:l: 



PAGE: 2 



47 
48 
49 
50 
51 
52 
53 
54 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 



RAW SEQUENCE LISTING DATE: 09/16/98 

PATENT APPLICATION US/08/876,132 TIME: 10:17:15 

INPUT SET: S28494.raw 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AGATCTACAC AAGGCAAATT GAAAAAATAG ATAAAATTTT CGCAGGTATT AAAGCCGACT 60 

TAAAACAAAT GAGTGAAGAA GAAAAGAAAA AAATAAATAC ATATTTTGAG TTAGTAAAAG 120 

AGAAAGAAAA AATAAAAGAA GACCTCGGCT TAACAGTCGA AAAACCAGAA ATAATAAAAA 180 

GAAAGAGACT GTGATTTTTA ATGGAAATCG TGAGGAAAAG AAAATTTTAA TTTTCATTTT 24 0 

CGAGGGATTA ATTTGTTGTA AGTTGATGAA AAATCTAGAT AAAAAATGCA GATCAAAAAT 300 

GTGTTGAATT TGACATTATT GAAATACGTA GTATATCAAT AATGGGGGTT TGTCTATTTT 360 

ATTTTGCGAA GATTGAAAAT CTGAGTGAAA GAAAATAGTT TGCGAGAGCA AAAAAACCCT 420 

TGCCGTTTTT TTCAAATGAC TTTGGAAAAA ATTCATTGTG AGCGGTAGCG AAACTTTGAA 480 

ATTTTTTACA TTGGAAATTT GAAAAAATAA GGCAAAAGAA ACTCAAATGG AAAAAATATT 540 

ATTATAAAAA AAGGAGATCG GATATGGATT TTAAAAGCAG AAAACTGACA TTGAATGAAA 600 

AAAAAGATTT GGAAAAAATC TATGCTGAGA GTGAATTAAA AGCAAAAAAA TTGGGAACTC 660 

AACCCGGTGT TGTTTTAGAA ATGACGATGA AAGAAATGAT GAAAAATATC AACCTCGATG 720 

TTAATGAAGA AACAGCAGGT CAATATAGGA AATTATTCAA AAATAAAGTT GAGCATAGTA 780 

AATCAGATGA TCTAGTAACG GGACTATTAG AGTGTGGAAC TCGAAATAGT TTTGATAAAA 840 

CAAGAAGTGC CTTTCGTTTT TGTATTTGTG AGAGAATTCA GCAACTGAGA AAAGAAGCTG 900 

ATAATGCAAG AAGAGTAAAA GATTTCGATA CAATGAAAGC AAAAACTAAA GAGGCTTTTG 960 

AATTGAGTTT TGTTTTTGAT AAGGATTTTT TGAGTGAAAA TAGAATTCAA TGGAATGATA 1020 

TTTCTCACAA CAAAAAAGAC TCTGCAAGTA AAAGAAAAAC AATGAAAGAA GCGGACACAA 1080 

TGGATGATAT TTTTAAGAGG CTAAAAAATA ATAAATCTAC ATATGATCGT TATGCTGGAT 1140 

TCCTTTCTAT TTGTTCGATT ACAGGTTGCA GACCAGCAGA AGTTTTAAAG GG TAT AG AG A 1200 

TAGTAAGAAA CAGATATGAG GATGGTATAT CTTTTAAAAT ACTTGGTGCA AAGGTTGGAA 1260 

ATGACAGAGG GCAAAGCGAA AGAACATTAC ATTTTGATTT ATCAAAATAT CATGATAATG 1320 

AGCAAATGAA TTATATTTTG TCGCAATTAA AAGATAATAA ATTTTTCTAC AAACCAGATG 1380 

GGAAGCTCTA CAACAGCTTG AGGCAATACC TCTACATCCA ACATAGAACG TTTTCACTGT 1440 

ATACACTTCG TCACAGGGTT GCGAGTGATC TCAAGGCATC CGGTGCAGAT GACTTCACCA 1500 

TAGCGGCTNT TTTGGGTCAC AGAGTGACTC AAAGCCAGGA GTTACTACGG CTATGCTCGT 1560 

TCGTCGNAAG GTGGTATCGC TGTAACTGGT GTTGAGTGCT CTGATGTTGT GAAAGCAAAC 1620 

AAGAGTCAGT TNGCTGTATC AAGGACTCCG AGCCAGATCT 1660 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGATCTCAAC CAGTTTAAAA TCGCACTTCA AGAAGTAAAA ATAGGGGCCG GCACCGGCTC 60 

TTTTTTTGGT GTTTTTGTAG TTAGTGGATA TATCTGTTAG CTACAGAGAA AAGCGATTTT 120 

AGAGGGTTTG ACGAGGTTTT TTCGAGCTAT CCAGGGTTTT TGGGTTTTTG GGGTTGGATC 180 

AGAAAAGTCG TTCAAGATTA TTGACATAAA GACAGGAAGG TTTATAACAA GTACCAGATA 240 

CGACAAAACC AGCTTTGCAG GCTGGCTTTG AAGGACTAAA AGAAGTGGGG ACTTCTTTGA 300 



Teen 

PAGE: 3 RAW SEQUENCE LISTING DATE: 09/ 1 6/98 

PATENT APPLICATION US/08/876,132 TIME: 10:17:17 

INPUT SET: S28494.raw 

100 GTCTTGTAAT CAAGTTGGTC AGAACTCGAT TACGATTTGT AAGTAGAAAT CTAACTCACA 360 

101 TTTCGCAGAA AGTCAAACTT ACCTCTTAGT TACAACTCAA AAATTTCCTA GCCTTTTCAG 420 

102 ATCCTTAAGC ATACATATTT TGTTTAAACC GATTGTGTCC GGTGTTTGGT GTGGAGCCAT 480 
10 3 TGATCCGAGT GGTCAATATG TGATTGTTCG CCAAACAGTG TATGTAGGTC TAAACGGGGA 540 

104 GTGCTACAAA AG AC C AT AC C CGAAACGAGT GCCTAAGTGT TTTGGTTATC AACCAGGTAA 600 

105 GCTATGAGAA AGCCCAGCCA TAAATGGGGT TAGGTTGAAG CAAGTCTTCA TATGGTGCGA 660 

106 CACAAGGGGT GTAGTAGGGT GTCGTCAAAC TGAAAGGTTT GATAGCTCTA AGCTTGTGCT 720 

107 TCTGTGGGTC AAGCCTCAAG TGCTGATCTG TGGTGTCGTC TACCTGATAA CTTTCACTTT 780 

108 TTCGAGTGAA ATTCAGGAGG CGAAACTATG GGTCAAGCCC AGCTTTGCTG GGGTTCGGCA 840 

109 CATCCAGCTT ACAGCATTGG TGCTCTTGCG AAGCTGAAGC ACAAAAATCT AATCCAGGGT 900 

110 TTGGGTTTTT TATACCAGAA GCAAAACAAA AAAATAAAAC AAAGAAAAAT TTTCGAGCGA 960 

111 AAAAATATTT TGGAATTTTT TAAAGGCGAT ACTTGCTACC GCACTTTTGC CATATTTAAA 1020 

112 ACCTGACTAT CTTTATAAGT TAATAGATAT ATCCGTTAGA TTATAAAGTA TGTTAAAAAC 1080 

113 GAGTAAAAAC AATAACTTAT ATATTTAATT CTGAATTATA TTTGACAGTG ATTATTTAAT 1140 

114 ATATTAAGAG ATATATCTAT TAGCTTAAAT ATAACTAAAA AAAGAGGTAA ATATATGGAT 1200 

115 TGTGTATTTA AAAAAGCATT AGAAAATGAA ATAGAACATT ATAAAAAAGA CGGTGATATC 1260 

116 AAATCTTTCT TACAATACTT GCATTACTTT GAT AT AG ATA AAGCATTAAA TGGTGATGAA 1320 

117 TGTGGCGATA TTATAAACTC AAATTTATCC ATTGATGAAA GTTTTGATCT TCTTGATGTT 1380 

118 GAGCACAATT TCGGCTGGGC TTTCAATAAA ATAATACAGA GACGAAATGA ATATTTATCA 1440 

119 TCAGCTAAAA CTGAAAATGA TTTTAAAAAA TACTCGTTCT TTATTCATTC GATCAATTGG 1500 

120 GAAGAATTTA ATTACGATGA GATGAGTACA ATACATCAAG AAATGATTAA AG G ATT AG AT 1560 

121 AATTACACAT ATGGAGAAAT AACCATATGA ATAATAAAAT AAGAGAATAT ATTGATTTCG 1620 

122 AAATAACAAA AGATATAAAA GAAAGTCAGC TCTTAAAAAT ATCTGCATTG ATCGATGTTT 1680 
12 3 TAAAAGTAGA TGAAAAATTT ATTGATGAAG AGGATTTGCA ACTAAAGATA TTGAAAATAT 1740 

124 CGTATGAAAA TCCTATTGAT GATCCAGATG ATGGCATAAG AAAATCACAA TTCGCACGAA 1800 

125 GAAATGCCTA TGCTTTCCGC ATTAAAAAAA CAAGCAAAAA GAGATCT 1847 
126 

127 (2) INFORMATION FOR SEQ ID NO: 3: 
128 

12 9 (i) SEQUENCE CHARACTERISTICS: 

130 (A) LENGTH: 371 amino acids 

131 (B) TYPE: amino acid 

132 (C) STRANDEDNESS : single 

13 3 (D) TOPOLOGY: linear 
134 

135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
136 

137 Asn Phe Leu His Trp Lys Phe Glu Lys lie Arg Gin Lys Lys Leu Lys 

138 15 10 15 
139 

140 Trp Lys Lys Tyr Tyr Tyr Lys Lys Arg Arg Ser Asp Met Asp Phe Lys 

141 20 25 30 
142 

143 Ser Arg Lys Leu Thr Leu Asn Glu Lys Lys Asp Leu Glu Lys lie Tyr 

144 35 40 45 
145 

146 Ala Glu Ser Glu Leu Lys Ala Lys Lys Leu Gly Thr Gin Pro Gly Val 

147 50 55 60 
148 

14 9 Val Leu Glu Met Thr Met Lys Glu Met Met Lys Asn lie Asn Leu Asp 
150 65 70 75 80 
151 

152 Val Asn Glu Glu Thr Ala Gly Gin Tyr Arg Lys Leu Phe Lys Asn Lys 



PAGE: 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/876, 132 



DATE: 09/16/98 
TIME: 10:17:19 



INPUTSET: S28494.ruw 
153 85 90 95 

154 

155 Val Glu His Ser Lys Ser Asp Asp Leu Val Thr Gly Leu Leu Glu Cys 

156 100 105 110 
157 

158 Gly Thr Arg Asn Ser Phe Asp Lys Thr Arg Ser Ala Phe Arg Phe Cys 

159 115 120 125 
160 

161 lie Cys Glu Arg lie Gin Gin Leu Arg Lys Glu Ala Asp Asn Ala Arg 

162 130 135 140 
163 

164 Arg Val Lys Asp Phe Asp Thr Met Lys Ala Lys Thr Lys Glu Ala Phe 

165 145 150 155 160 
166 

167 Glu Leu Ser Phe Val Phe Asp Lys Asp Phe Leu Ser Glu Asn Arg lie 

168 165 170 175 
169 

170 Gin Trp Asn Asp lie Ser His Asn Lys Lys Asp Ser Ala Ser Lys Arg 

171 180 185 190 
172 

173 Lys Thr Met Lys Glu Ala Asp Thr Met Asp Asp lie Phe Lys Arg Leu 

174 195 200 205 
175 

176 Lys Asn Asn Lys Ser Thr Tyr Asp Arg Tyr Ala Gly Phe Leu Ser lie 

177 210 215 220 
178 

179 Cys Ser lie Thr Gly Cys Arg Pro Ala Glu Val Leu Lys Gly lie Glu 

180 225 230 235 240 
181 

182 lie Val Arg Asn Arg Tyr Glu Asp Gly lie Ser Phe Lys lie Leu Gly 

183 245 250 255 

184 Ala Lys Val Gly Asn Asp Arg Gly Gin Ser Glu Arg Thr Leu His Phe 

185 260 265 270 
186 

187 Asp Leu Ser Lys Tyr His Asp Asn Glu Gin Met Asn Tyr lie Leu Ser 

188 275 280 285 
189 

190 Gin Leu Lys Asp Asn Lys Phe Phe Tyr Lys Pro Asp Gly Lys Leu Tyr 

191 290 295 300 
192 

193 Asn Ser Leu Arg Gin Tyr Leu Tyr lie Gin His Arg Thr Phe Ser Leu 

194 305 310 315 320 
195 

196 Tyr Thr Leu Arg His Arg Val Ala Ser Asp Leu Lys Ala Ser Gly Ala 

197 325 330 335 
198 

199 Asp Asp Phe Thr lie Ala Ala Xaa Leu Gly His Arg Val Thr Gin Ser 

200 340 345 350 
201 

202 Gin Glu Leu Leu Arg Leu Cys Ser Phe Val Xaa Arg Trp Tyr Arg Cys 

203 355 360 365 
204 

205 Asn Trp Cys 



PAGE: 5 RAW SEQUENCE LISTING DATE: 09/16/98 

PATENT APPLICATION US/08/876,132 TIME: 10:17:20 

INPUT SET: S28494.raw 

206 370 

207 

208 

209 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/08/876,132 



DATE: 09/16/98 
TIME: 10:17:20 



INPUT SET: S28494.raw 



Original Text 



PAGE: 1 



1802 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/876,132 



DATE: 09/09/98 
TIME: 15:03:05 



INPUT SET: S28494.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 
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SEQUENCE LISTING 



(1) 



General Information 



(i) APPLICANT: Fowler, Timothy 
Stuart, Causey 



Does Not CompW 
Corrected Diskette Needed 



(ii) TITLE OF THE INVENTION: ENTEROBACTERIACEAE FERMENTATION 

STAINS 

(iii) NUMBER OF SEQUENCES: 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genencor International, Inc. 

(B) STREET: 925 Page Mill Road 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304-1013 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/876,132 

(B) FILING DATE: 23-JUN-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Glaister, Debra J. 

(B) REGISTRATION NUMBER: 33,888 

(C) REFERENCE/ DOCKET NUMBER: GC3 72 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650-846-7620 

(B) TELEFAX: 650-845-6504 



(2) INFORMATION FOR SEQ ID NO:l: 



PAGE: 2 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/876,132 



DATE: 09/09/98 
TIME: 15:03:05 



INPUT SET: S28494.raw 

47 (i) SEQUENCE CHARACTERISTICS: 

48 (A) LENGTH : 1660 base pairs 

49 (B) TYPE: nucleic acid 

50 (C) STRANDEDNESS : single 

51 (D) TOPOLOGY: linear 
52 

5 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

54 

55 AGATCTACAC AAGGCAAATT GAAAAAATAG ATAAAATTTT CGCAGGTATT AAAGCCGACT 60 

56 TAAAACAAAT GAGTGAAGAA GAAAAGAAAA AAATAAATAC ATATTTTGAG TTAGTAAAAG 120 

57 AGAAAGAAAA AATAAAAGAA GACCTCGGCT TAACAGTCGA AAAACCAGAA ATAATAAAAA 180 

58 GAAAGAGACT GTGATTTTTA ATGGAAATCG TGAGGAAAAG AAAATTTTAA TTTTCATTTT 240 

5 9 CGAGGGATTA ATTTGTTGTA AGTTGATGAA AAATCTAGAT AAAAAATGCA GATCAAAAAT 300 

60 GTGTTGAATT TGACATTATT GAAATACGTA GTATATCAAT AATGGGGGTT TGTCTATTTT 360 

61 ATTTTGCGAA GATTGAAAAT CTGAGTGAAA GAAAATAGTT TGCGAGAGCA AAAAAACCCT 420 

62 TGCCGTTTTT TTCAAATGAC TTTGGAAAAA ATTCATTGTG AGCGGTAGCG AAACTTTGAA 480 

6 3 ATTTTTTACA TTGGAAATTT GAAAAAATAA GGCAAAAGAA ACTCAAATGG AAAAAATATT 540 

64 ATTATAAAAA AAGGAGATCG GATATGGATT TTAAAAGCAG AAAACTGACA TTGAATGAAA 600 

65 AAAAAGATTT GGAAAAAATC TATGCTGAGA GTGAATTAAA AGCAAAAAAA TTGGGAACTC 660 

66 AACCCGGTGT TGTTTTAGAA ATGACGATGA AAGAAATGAT GAAAAATATC AACCTCGATG 720 

67 TTAATGAAGA AACAGCAGGT CAATATAGGA AATTATTCAA AAATAAAGTT GAGCATAGTA 780 

68 AATCAGATGA TCTAGTAACG GGACTATTAG AGTGTGGAAC TCGAAATAGT TTTGATAAAA 840 

6 9 CAAGAAGTGC CTTTCGTTTT TGTATTTGTG AGAGAATTCA GCAACTGAGA AAAGAAGCTG 900 

70 ATAATGCAAG AAGAGTAAAA GATTTCGATA CAATGAAAGC AAAAACTAAA GAGGCTTTTG 960 

71 AATTGAGTTT TGTTTTTGAT AAGGATTTTT TGAGTGAAAA TAGAATTCAA TGGAATGATA 1020 

72 TTTCTCACAA CAAAAAAGAC TCTGCAAGTA AAAGAAAAAC AATGAAAGAA GCGGACACAA 1080 

7 3 TGGATGATAT TTTTAAGAGG CTAAAAAATA ATAAATCTAC ATATGATCGT TATGCTGGAT 1140 

74 TCCTTTCTAT TTGTTCGATT ACAGGTTGCA GACCAGCAGA AGTTTTAAAG GGTATAGAGA 1200 

75 TAGTAAGAAA CAGATATGAG GATGGTATAT CTTTTAAAAT ACTTGGTGCA AAGGTTGGAA 1260 

76 ATGACAGAGG GCAAAGCGAA AGAACATTAC ATTTTGATTT ATCAAAATAT CATGATAATG 1320 

77 AGCAAATGAA TTATATTTTG TCGCAATTAA AAGATAATAA ATTTTTCTAC AAACCAGATG 1380 

78 GGAAGCTCTA CAACAGCTTG AGGCAATACC TCTACATCCA ACATAGAACG TTTTCACTGT 1440 

79 ATACACTTCG TCACAGGGTT GCGAGTGATC TCAAGGCATC CGGTGCAGAT GACTTCACCA 1500 

80 TAGCGGCTNT TTTGGGTCAC AGAGTGACTC AAAGCCAGGA GTTACTACGG CTATGCTCGT 1560 

81 TCGTCGNAAG GTGGTATCGC TGTAACTGGT GTTGAGTGCT CTGATGTTGT GAAAGCAAAC 1620 

82 AAGAGTCAGT TNGCTGTATC AAGGACTCCG AGCCAGATCT 166 0 
83 

84 (2) INFORMATION FOR SEQ ID NO: 2: 

85 

86 (i) SEQUENCE CHARACTERISTICS: 

87 (A) LENGTH: 1847 base pairs 

88 (B) TYPE: nucleic acid 

89 (C) STRANDEDNESS: single 

90 (D) TOPOLOGY: linear 
91 

92 

93 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

94 

95 AGATCTCAAC CAGTTTAAAA TCGCACTTCA AGAAGTAAAA ATAGGGGCCG GCACCGGCTC 60 

96 TTTTTTTGGT GTTTTTGTAG TTAGTGGATA TATCTGTTAG CTACAGAGAA AAGCGATTTT 120 

97 AGAGGGTTTG ACGAGGTTTT TTCGAGCTAT CCAGGGTTTT TGGGTTTTTG GGGTTGGATC 180 

98 AGAAAAGTCG TTCAAGATTA TTGACATAAA GACAGGAAGG TTTATAACAA GTACCAGATA 240 
9 9 CGACAAAACC AGCTTTGCAG GCTGGCTTTG AAGGACTAAA AGAAGTGGGG ACTTCTTTGA 300 



PAGE: 3 * ' RAW SEQUENCE LISTING DATE: 09/09/98 

PATENT APPLICATION US/08/876,132 TIME: 15:03:06 

INPUTSET: S28494.raw 

100 GTCTTGTAAT CAAGTTGGTC AGAACTCGAT TACGATTTGT AAGTAGAAAT CTAACTCACA 360 

101 TTTCGCAGAA AGTCAAACTT ACCTCTTAGT TACAACTCAA AAATTTCCTA GCCTTTTCAG 420 

102 ATCCTTAAGC ATACATATTT TGTTTAAACC GATTGTGTCC GGTGTTTGGT GTGGAGCCAT 480 

103 TGATCCGAGT GGTCAATATG TGATTGTTCG CCAAACAGTG TATGTAGGTC TAAACGGGGA 540 

104 GTGCTACAAA AGACCATACC CGAAACGAGT GCCTAAGTGT TTTGGTTATC AACCAGGTAA 600 

105 GCTATGAGAA AGCCCAGCCA TAAATGGGGT TAGGTTGAAG CAAGTCTTCA TATGGTGCGA 6 60 

106 CACAAGGGGT GTAGTAGGGT GTCGTCAAAC TGAAAGGTTT GATAGCTCTA AGCTTGTGCT 7 20 

107 TCTGTGGGTC AAGCCTCAAG TGCTGATCTG TGGTGTCGTC TACCTGATAA CTTTCACTTT 780 

108 TTCGAGTGAA ATTCAGGAGG CGAAACTATG GGTCAAGCCC AGCTTTGCTG GGGTTCGGCA 840 

109 CATCCAGCTT ACAGCATTGG TGCTCTTGCG AAGCTGAAGC ACAAAAATCT AATCCAGGGT 900 

110 TTGGGTTTTT TATACCAGAA GCAAAACAAA AAAATAAAAC AAAGAAAAAT TTTCGAGCGA 960 

111 AAAAATATTT TGGAATTTTT TAAAGGCGAT ACTTGCTACC GCACTTTTGC CATATTTAAA 1020 

112 ACCTGACTAT CTTTATAAGT TAATAGATAT ATCCGTTAGA TTATAAAGTA TGTTAAAAAC 1080 

113 GAGTAAAAAC AATAACTTAT ATATTTAATT CTGAATTATA TTTGACAGTG ATTATTTAAT 1140 

114 ATATTAAGAG ATATATCTAT TAGCTTAAAT AT A ACT AAA A AAAGAGGTAA ATATATGGAT 1200 

115 TGTGTATTTA AAAAAGCATT AGAAAATGAA ATAGAACATT ATAAAAAAGA CGGTGATATC 1260 

116 AAATCTTTCT TACAATACTT GCATTACTTT GATATAGATA AAGCATTAAA TGGTGATGAA 1320 

117 TGTGGCGATA TTATAAACTC AAATTTATCC ATTGATGAAA GTTTTGATCT TCTTGATGTT 1380 

118 GAGCACAATT TCGGCTGGGC TTTCAATAAA ATAATACAGA GACGAAATGA ATATTTATCA 1440 

119 TCAGCTAAAA CTGAAAATGA TTTTAAAAAA TACTCGTTCT TTATTCATTC GATCAATTGG 1500 

120 GAAGAATTTA ATTACGATGA GATGAGTACA ATACATCAAG AAATGATTAA AGGATTAGAT 1560 

121 AATTACACAT ATGGAGAAAT AACCATATGA ATAATAAAAT AAGAGAATAT ATTGATTTCG 1620 
12 2 AAATAACAAA AGATATAAAA GAAAGTCAGC TCTTAAAAAT ATCTGCATTG ATCGATGTTT 1680 
12 3 TAAAAGTAGA TGAAAAATTT ATTGATGAAG AGGATTTGCA ACTAAAGATA TTGAAAATAT 1740 

124 CGTATGAAAA TCCTATTGAT GATCCAGATG ATGGCATAAG AAAATCACAA TTCGCACGAA 1800 

125 GAAATGCCTA TGCTTTCCGC ATTAAAAAAA CAAGCAAAAA GAGATCT 1847 
126 

12 7 (2) INFORMATION FOR SEQ ID NO: 3: 

128 

12 9 (i) SEQUENCE CHARACTERISTICS: 



134 

135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

136 

137 Asn Phe Leu His Trp Lys Phe Glu Lys lie Arg Gin Lys Lys Leu Lys 

138 15 10 15 
139 

140 Trp Lys Lys Tyr Tyr Tyr Lys Lys Arg Arg Ser Asp Met Asp Phe Lys 

141 20 25 30 
142 

14 3 Ser Arg Lys Leu Thr Leu Asn Glu Lys Lys Asp Leu Glu Lys lie Tyr 

144 35 40 45 

145 

146 Ala Glu Ser Glu Leu Lys Ala Lys Lys Leu Gly Thr Gin Pro Gly Val 

147 50 55 60 
148 

14 9 Val Leu Glu Met Thr Met Lys Glu Met Met Lys Asn lie Asn Leu Asp 
150 65 70 75 80 

151 

152 Val Asn Glu Glu Thr Ala Gly Gin Tyr Arg Lys Leu Phe Lys Asn Lys 




PACE: 4 ' RAW SEQUENCE LISTING DATE: 09/09/98 

PATENT APPLICATION US/08/876,132 TIME: 15:03:09 

INPUT SET: S28494.raw 
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Cys 
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Gly 


Thr 


Arg 
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Ser 


Phe 
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Lys 
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Arg 
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Ala 
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Arg 


Phe 


Cys 
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lie 


Cys 


Glu 


Arg 


He 


Gin 


Gin 


Leu 


Arg 


Lys 


Glu 


Ala 


Asp 


Asn 


Ala 


Arg 


162 
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